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Abstract 

We first review the derivation of the exact expression for the av- 
erage distance {rn) of the n-th neighbour of a reference point among 
a set of N random points distributed uniformly in a unit volume of a 
Z'-dimensional geometric space. Next we propose a 'mean-field' the- 
ory of (r„) and compare it with the exact result. The result of the 
'mean-field' theory is found to agree with the exact expression only in 
the limit D ^ oo and n ^ oo. Thus the 'mean- field' approximation 
is useless in this context. 



1 Introduction to the average n-th neighbour 
distance 



Consider (a large number) points distributed randomly and uniformly in 
a unit volume of a D-dimensional geometric space. A point is said to be the 
n-th neighbour of another (the reference point) if there are exactly n — 1 other 
points that are closer to the latter than the former. The average distance 
to the first neighbour is exactly known though originally calculated in 
three dimensions the method can be used for any finite dimension D : The 
probability distribution P{ri)dri of the first neighbour distance is defined by 
the probability of finding the first neighbour of a given reference point at a 
distance between ri and ri + dri : 

P(ri) dri = [1 - V{ri)f'^ {N - 1) dV(ri), (1) 

where V{ri) /r{D/2 + 1) is the volume of a D-dimensional 

hypersphere of radius ri centered at the reference point. The average first 
neighbour distance is defined as : 

(ri) = /"^ n Pin) dri, (2) 
Jo 

where R is the radius of a Z)-dimensional hypersphere of unit volume : 
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With the probability distribution of equation we get 



(3) 



in) = I r,[l-V{n)f-^{N -l)dV{n) 
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Now we address the general problem : What is the form of the aver- 
age n-th neighbour distance, for any finite n? Though this is a problem of 
purely geometric nature, the quantity {r^^^in)^ is relevant in physical and 
computational contexts; for example, in astrophysics we need to know the 
average distance between neighbouring stars distributed independently in a 



2 



homogeneous universe 0, and in the travehng salesman problem we need 
the average distance of the neighbours of each site for estimating the optimal 
path- length [Q. 

We proceed by extending the line of argument used in the case of the first 
neighbour [|l| to the n-th neighbour. The probability distribution of the n-th 
neighbour distance r„ is defined as the probablity P(r„)dr„ of finding the n- 
th neighbour of a given reference point at a distance between r„ and r„-|-dr„. 
This is a conditional probability because we look for the n-th neighbour of a 
point when its first (n — 1) neighbours have already been located : 



P(rry) dr^ 
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{N - n) dV- (r„) 



1 - ^(rn-i) 

The quantity l^(r„) is the volume of a D-dimensional hypersphere of radius 
r„ centered at the reference point. For a given reference point and its first 
n — 1 neighbours the average n-th neighbour distance is obtained as : 

(^n) (particular) = / r„ P (r„) dr„ (6) 

where, as before, R is the radius of a D-dimensional hypersphere of unit 
volume. The quantity (^^n) (particular) ^ function of a particular r„_i, r„_2, . . ., 
ri which are the distances of the first n — 1 neighbours of the given reference 
point. To calculate the ensemble average of r„ the quantity (^^n) (particular) 
must be averaged successively over the probability distributions of each of 
the first n — 1 neighbours : 

pR pR pR 

{rn) = / dri P(ri) / drs P(r2) ■ ■ ■ / dr„_2 P(r„_2) 

r dr„_iP(r„_i) dr„r„P(r„) (7) 
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where the probability distribution of the z-th neighbour is given by equation 
1^ with i replacing n. After a change in the order of the integrals in equation 

0: 
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(O = (N-l)(N-2)---(N-n) 
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X / dV{rn) [V^(r„)]^/^ [l-V{rn)] 
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(8) 



which gives the final form of the average n-th neighbour distance : 
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This resuh was reported in 0]. 

Next we consider fiuctuations 5r„ occuring in r^. This can be calculated 
exactly for any neighbour n ; the mean square deviation in r„ from its average 
value is given by : 
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which vanishes as D ^ oo. This suggests that the form of (r„) for large D 
can be arrived at by neglecting fiuctuations, an approach which corresponds 
to mean-field theories in statistical mechanics. 



2 A 'mean-field' theory 

By the following 'mean-field' argument we derive an expression for the aver- 
age n-th neighbour distance in large dimensions D. Since the average first 
neighbour distance can be found easily, we derive (r„) in terms of (ri). As 
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before we consider N (a large number of) random points distributed uni- 
formly within a unit volume of a D-dimensional geometric space. We choose 
any one of them as the reference point and locate its n-th neighbour. Ne- 
glecting fluctuations, which we can do for large D, the distance between 
them is r„(A^) ^ (r„(A^)). Keeping these two points fixed we change the 
number of points in the unit volume to Na by adding or removing points at 
random; the factor a is arbitrary to the extent that Na and na are natural 
numbers. Since the distribution of points is uniform, the hypersphere that 
had originally enclosed just n points will now contain na points. Therefore, 
what was originally the n-th neighbour of the reference point now becomes 
the na-th neighbour. Since the two points under consideration are fixed, so 
is the distance between them. Consequently, 

(r„(iV)) ^ (r„,(iVa)) . (11) 
Now we take a = 1/n, so that 

{rn{N)) ^ {n{N/n)), (12) 

which shows that the average n-th neighbour distance for a set of random 
points distributed uniformly is approximately given by the average distance 
for a depleted set of N/n random points in the same volume. Using the 
expression for {ri{N)) from equation ^ we get 
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Since the above argument neglects fluctuations the result of equation [13| 
ought to be exact in the limit D —>■ oo. 

The exact expression of (r„) for a finite dimension D is expected to reduce 



to the form of equation [T^ as D ^ oo where fluctuations do not affect. For 



large D equation ^ takes the following form : 
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(14) 



For the above expression to reduce to the form of equation |T3| the sum J2k=i i 



must be equal to logg n which happens only in the limit n — oo. Thus for any 
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finite n the exact result of equation fails to produce the fluctuation-free form 



of equation |T3| in large dimensions D. This shows that the 'mean-field' ap- 
proximation is useless in the present context. However the 'mean-field' ap- 
proach for (r„) may be used as a crude approximation in other distributions 
(non-uniform) of random points where an exact calculation is not possible 
beyond the first neighbour. 
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